nydus: support host-sharing by csegarragonz · Pull Request #131 · sc2-sys/deploy

csegarragonz · 2025-01-20T18:40:07Z

The main issue to support host-sharing is the co-existence of guest-pulling and host-sharing versions of the nydus snapshotter, as well as other non-nydus snapshotters.

Before starting a pod sandbox, containerd will check if the pause image is present on the machine. If it is not it will "pull it" with the help of the snapshotter. This "pull" is crucial as the nydus snapshotter will export the right Kata virtual colume to handle the pull:

For guest-pull it will generate a GuestImagePull virtual volume that will make the Kata Agent try to pull_image inside the guest (for the pause image this is just unpacking the bundle in the initrd).
For host-share it will convert the OCI layers to tarfs layers, and generate virtual volumes that indicate the blobs to mount into the guest.

Consequently, if we skip the "pull" that happens during ensureImageExists, we never generate this virtual volumes, and execution fails. In order to make sure we pull when we need to, containerd needs to keep a per-snapshotter map of what images it has already pulled. Here's the catch: guest-pull and host-share are technically the same snapshotter. (This also applies for variations of the host-share mode: image_block, layer_block and each one with _verity.)

The solution we will adopt is to install host-sharing as a "different" snapshotter (which we implement here) together with a patch in containerd that keeps track of what images have been pulled on a per-snapshotter basis (not on a global basis).

On top of that, we need to do some work on Kata, but most of it is already here: kata-containers/kata-containers#7837. The only notable additions are to manually start the udev daemon, as we are still using the Kata Agent as /init, and to check that the (host-)mounted dm-verity hashes actually correspond to the layer digests. The latter will wait until we re-introduce image signature validation and attestation, as we need to get the ground truth from somewhere.

Another issue we ran into while testing is that, if two layers have the same digest, the tarfs module in the nydus-snapshotter will sometimes trigger an error due to a race condition.

Once this PR is merged in, both snapshotters should be usable without having to restart the snapshotters by using:

inv nydus-snapshotter.set-mode [guest-pull|host-share]

the host-share mode uses layer_block_with_verity. With guest-pull mode, however, there is an open issue in the upstream repo regarding snapshotter restarts: containerd/nydus-snapshotter#631. This means that, in general, when changing the snapshotter mode it is safer to purge all snapshots:

inv nydus-snapshotter.purge

Purging also proved tricky, as it is not enough to remove the contents of /var/lib/containerd-nydus*. containerd keeps track of snapshot metadata in its metadata DB in /var/lib/containerd/io.containerd.metadata.v1.bolt/meta.db. We cannot easily delete elements from that DB, so after removing the snapshots manually, we wait for the GC to remove the elements from the DB.

csegarragonz force-pushed the nydus-host branch from eb3be71 to 0d32825 Compare January 24, 2025 16:39

csegarragonz added 4 commits January 31, 2025 16:04

nydus: support host-sharing

05e4c78

nydus: minor fix

b265312

nydus: change names in set mode

5ae8036

single entrypoint to set log level

a3a029f

csegarragonz force-pushed the nydus-host branch from 0d32825 to a3a029f Compare January 31, 2025 16:04

csegarragonz added 5 commits January 31, 2025 16:38

add method to stop containerd and nydus-sn containers

7fc5a49

nydus: install host-sharing as separate snapshotter

25236dc

improve purge + add doc

0b4028f

docs: update

b46a1a4

docs: update

668c4eb

csegarragonz marked this pull request as ready for review February 3, 2025 12:25

csegarragonz mentioned this pull request Feb 3, 2025

sc2: support guest/host image pulling with nydus #97

Open

4 tasks

csegarragonz added 11 commits February 6, 2025 10:28

gha: purge when setting snapshotter mode

95e373f

nydus-snapshotter: better clean-up

eb324d0

ns: cleanup and debug

e2a569d

gha: add sleep before purge

0d00170

gha: more debugging

0a7ba15

gha: restart vm-cache after change snapshotter mode

a655fed

gha: export variable before

7c0a11c

gha: fix

294fa97

tools: update check-fork-hashes tool

ae794d2

nydus-image: add support for hot-replacing and patching

b23cf7a

nydus-snapshotter: fix purge by waiting on metadata to be gc-ed

337bccc

csegarragonz force-pushed the nydus-host branch from 3045a84 to 337bccc Compare February 7, 2025 16:19

csegarragonz added 5 commits February 7, 2025 16:23

check-hashes: run cargo fmt

b2a5d08

containerd: fix nit in bbolt install

05daf59

containerd: fix bbolt clean-up

1f1ebe6

gha: add debug logging

96647c7

bbolt: more installation fix-ups

34d7e41

csegarragonz added 9 commits February 10, 2025 16:47

ns: fix typo

d1ef0ec

gha: temporarily disable host-share tests with tdx

1d18319

gha: remove --debug

68cfe5e

ns: fix deploy without --debug

5222854

gha: fix knative tests

e012412

gha: run knative chaining in host-share

00431d4

gha: fixes

80494f6

gha: more fetch

4cbbf14

gha: tests passing

f8ab3de

csegarragonz merged commit 1bca71a into main Feb 11, 2025
3 checks passed

csegarragonz deleted the nydus-host branch February 11, 2025 16:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nydus: support host-sharing#131

nydus: support host-sharing#131
csegarragonz merged 34 commits intomainfrom
nydus-host

csegarragonz commented Jan 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

csegarragonz commented Jan 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

csegarragonz commented Jan 20, 2025 •

edited

Loading